Efficient Parallel Algorithms for Hierarchically Semiseparable Matrices
نویسندگان
چکیده
Recently, hierarchically semiseparable (HSS) matrices have been used in the development of fast direct sparse solvers. Key applications of HSS algorithms, coupled with multifrontal solvers, appear in solving certain large-scale computational inverse problems. Here, we develop massively parallel HSS algorithms appearing in these solution methods, namely, parallel HSS construction using the rank revealing QR (RRQR) method, parallel HSS ULV factorization, and parallel HSS solution. HSS representations have a nice binary tree structure called HSS tree. HSS operations can be conducted following the traversal of this tree, and communications are generally limited to between siblings and between parents and children. Thus, HSS algorithms are often highly scalable. BLACS [1] and ScaLapack [3] are used as our porTable libraries. We construct contexts (sub-communicators) on each node of the HSS tree and exploit the governing 2D-block-cyclic data distribution scheme widely used in ScaLapack. Computational examples confirm the weak scaling, strong scaling and accuracy of our implementation.
منابع مشابه
Efficient Scalable Algorithms for Hierarchically Semiseparable Matrices
Hierarchically semiseparable (HSS) matrix algorithms are emerging techniques in constructing the superfast direct solvers for both dense and sparse linear systems. Here, we develope a set of novel parallel algorithms for the key HSS operations that are used for solving large linear systems. These include the parallel rank-revealing QR factorization, the HSS constructions with hierarchical compr...
متن کاملFast algorithms for hierarchically semiseparable matrices
Semiseparable matrices and many other rank-structured matrices have been widely used in developing new fast matrix algorithms. In this paper, we generalize the hierarchically semiseparable (HSS) matrix representations and propose some fast algorithms for HSS matrices. We represent HSS matrices in terms of general binary HSS trees and use simplified postordering notation for HSS forms. Fast HSS ...
متن کاملAn efficient hybrid tridiagonal divide-and-conquer algorithm on distributed memory architectures
In this paper, an efficient divide-and-conquer (DC) algorithm is proposed for the symmetric tridiagonal matrices based on ScaLAPACK and the hierarchically semiseparable (HSS) matrices. HSS is an important type of rankstructured matrices. Most time of the DC algorithm is cost by computing the eigenvectors via the matrix-matrix multiplications (MMM). In our parallel hybrid DC (PHDC) algorithm, MM...
متن کاملEfficient Scalable Algorithms for Solving Dense Linear Systems with Hierarchically Semiseparable Structures
Hierarchically semiseparable (HSS) matrix techniques are emerging in constructing superfast direct solvers for both dense and sparse linear systems. Here, we develop a set of novel parallel algorithms for key HSS operations that are used for solving large linear systems. These are parallel rank-revealing QR factorization, HSS constructions with hierarchical compression, ULV HSS factorization, a...
متن کاملTransforming a hierarchical into a unitary-weight representation
In this paper we consider a class of hierarchically rank structured matrices, including some of the hierarchical matrices occurring in the literature, such as hierarchically semiseparable (HSS) and certain H∈-matrices. We describe a fast O(rn log(n)) and stable algorithm to transform this hierarchical representation into a so-called unitary-weight representation, as introduced in an earlier wor...
متن کامل